Unit 3 imp

Unit 3 theory

Q1. Define forecasting. What are the main goals and applications of forecasting in business and economics?

Forecasting:
Forecasting is the process of predicting future events or trends based on historical data, analysis, and models.

Main Goals of Forecasting:

1️⃣ Future Planning:
To estimate future demand, sales, or production needs.
2️⃣ Risk Reduction:
Helps to reduce uncertainty by providing data-based insights.
3️⃣ Better Decision-Making:
Assists managers and policymakers in making strategic choices.
4️⃣ Resource Allocation:
Helps in proper budgeting and resource management.
5️⃣ Performance Improvement:
Businesses can track and improve future performance by learning from trends.

Applications in Business and Economics:


Q2. Differentiate between qualitative and quantitative forecasting methods. Give two examples of each.

Qualitative Forecasting Methods

What is it?
These methods are based on human judgment, opinions, and intuition, not on historical data. They are used when past data is not available or when forecasting new products, trends, or situations.
When to use?

  1. Delphi Method – A panel of experts is asked to give their opinions anonymously. Their responses are collected, summarized, and shared in rounds until a consensus forecast is reached.
  2. Market Research – Surveys or interviews with customers to predict demand or trends based on what people say they will do or prefer.

Quantitative Forecasting Methods

What is it?
These methods use mathematical models and historical data to make forecasts. They rely on data patterns, trends, seasonality, and statistics.
When to use?

  1. Moving Average – Takes the average of previous values (like sales over last 3 months) to predict the next value.
  2. Exponential Smoothing – Assigns more weight to recent data and less to older data to forecast future values more accurately.
Summary:
Basis Qualitative Forecasting Quantitative Forecasting
Definition Based on expert opinions, intuition, or judgment. Based on mathematical models and historical data.
Data Required Does not require numerical data. Requires past numerical data for calculations.
Usefulness Useful when data is not available or future is uncertain. Useful when past trends can be analyzed.
Accuracy Less accurate; subjective. More accurate; objective and data-driven.
Examples 1. Delphi Method 1. Time Series Analysis
2. Regression Analysis

Conclusion:

Qualitative methods rely on opinions and experience, while quantitative methods use mathematical models and data for forecasting.



Q3. Explain the variate component method in time series analysis. What are its key components?

What is Variate Component Method?

It is a method in time series analysis where we break the time series data into parts (called components) so we can understand and analyze each part separately.

The Variate Component Method is a technique used in time series analysis to break down (decompose) a time series into its individual components to better understand patterns and make predictions.


Key Components of Time Series:

1️⃣ Trend (T):
It shows the long-term movement or direction of the data over time.
Slow and steady increase or decrease in data over a long time.
Example: Gradual increase in sales over years.

2️⃣ Seasonal Component (S):
It refers to regular patterns that repeat at fixed periods (like months or quarters).
Patterns that repeat regularly in a fixed time (like every year or month)
Example: Increase in ice-cream sales during summer.

3️⃣ Cyclical Component (C):
It refers to long-term up and down movements not of fixed period, usually related to the business cycle.
Ups and downs in data over a long time but not in a fixed pattern.
Example: Economic booms or recessions.

4️⃣ Irregular/Random Component (I):
It includes Unexpected or random changes in the data
Example: Sudden drop in sales due to natural disaster.

Models Used:


Conclusion:

The Variate Component Method helps to analyze and forecast time series by studying each component separately, improving understanding of the data behavior.


Q4. When would you choose an additive model over a multiplicative model in time series decomposition? Explain with examples.

Use Additive Model when:
Use Multiplicative Model when:
In short:

Q5. What is a stationary time series? Distinguish between strict stationarity and weak stationarity.

Stationary Time Series (Simple Explanation):

A stationary time series is one where the data’s key properties do not change over time — specifically:

Types of Stationarity:
1. Strict Stationarity:

Strict Stationarity) means that the entire distribution of a time series remains the same over time — not just the mean or variance, but all moments and joint distributions.

2. Weak Stationarity (also called Covariance Stationarity):

Weak Stationarity (also called Second-order Stationarity) means that a time series has constant statistical properties over time — specifically mean, variance, and covariance.

Key Differences:
Feature Strict Stationarity Weak Stationarity
Based on Full distribution Mean, variance, and covariance only
Strict conditions Yes Relaxed conditions
Usage in practice Rarely checked (hard to test) Often used in models like ARIMA

Q6. Define autocorrelation function (ACF). How does it help in identifying the structure of a time series?

Definition:

The Autocorrelation Function (ACF) measures the correlation between a time series and its past values (lags).
It tells us how much the present value of the series is related to its past values.

In simple words:
ACF shows how current data points are similar or related to previous ones over different time gaps (lags).


Formula:

SEM 2/Time Series/attachments/Pasted image 20250625115158.png

How ACF Helps in Time Series Analysis

🔍 1. Identifying Time Series Patterns:

🧱 2. Model Selection:

📉 3. Checking Stationarity:


Conclusion:

The Autocorrelation Function is a key tool in time series analysis. It helps to understand structure, choose models, and test if data is stationary or not.

Q7. What is a correlogram? How is it useful in time series analysis?

Definition:
A correlogram is a graphical representation of the autocorrelation function (ACF) of a time series. It shows how correlated the current value is with its past values (lags), plotted against the number of lags.

How it is Useful in Time Series Analysis:
  1. Detects Serial Correlation:
    Helps identify if data points are related to previous values.

  2. Checks Stationarity:
    If autocorrelations drop quickly (cut off), the series may be stationary. If they decay slowly, the series is likely non-stationary.

  3. Identifies Seasonality:
    Regular repeating spikes at specific lags suggest seasonal patterns.

  4. Model Selection:
    Helps decide the order of AR (AutoRegressive) or MA (Moving Average) components for ARIMA models.

  5. Visual Diagnostic Tool:
    Makes it easier to understand autocorrelation behavior instead of reading values from a table.

Example:

A correlogram showing strong spikes at lag 12, 24, etc., may indicate yearly seasonality in monthly sales data.


Note

simple exponential smoothing:
a method to forecast when data has no trend or seasonality. it gives more weight to recent values and less to older ones. used when data stays mostly stable over time.

holt’s double exponential smoothing:
an improved version that works when data has a trend. it smooths both the level and the trend to give better forecasts. used when values are steadily increasing or decreasing.
Here is a simple, exam-ready answer for 5 marks on Brown’s Discounted Regression:


Brown’s Discounted Regression

Definition:

Brown’s Discounted Regression is a forecasting method that gives more weight to recent data and less weight to older data using a discounting factor.

It is often used when recent trends are more important than older patterns, especially in changing environments.


Key Idea:

SEM 2/Time Series/attachments/Pasted image 20250625123304.png.


Formula (Conceptual):

SEM 2/Time Series/attachments/Pasted image 20250625123315.png


Uses:


Advantages:


Disadvantages:


Conclusion:

Brown’s Discounted Regression is a practical forecasting method that adjusts predictions by discounting older data, making it ideal for fast-changing time series.


Q8. Compare simple exponential smoothing and Holt’s double exponential smoothing. When is each method appropriate?

Comparison: Simple Exponential Smoothing vs. Holt’s Double Exponential Smoothing

Feature Simple Exponential Smoothing (SES) Holt’s Double Exponential Smoothing
Trend Handling Does not handle trend Handles linear trend
Components Used Only level Level + Trend
Complexity Simple to apply More complex (uses two smoothing constants)
Forecast Equation Based on weighted average of past values Adds a trend term to the forecast
When to Use For stationary data (no trend/seasonality) For data with a trend (but no seasonality)
When is Each Method Appropriate?
Example:

Q9. Discuss any two exponential smoothing methods to forecast the future data.

1. simple exponential smoothing:

this method is used when the data has no trend or seasonality. it gives more weight to recent values and less to older ones using a smoothing factor called alpha (α), which lies between 0 and 1.
the formula is:
forecast = α × current actual value + (1 − α) × previous forecast
if α is close to 1, more weight is given to the recent data.
this method is easy to apply and best suited for short-term forecasts where data is steady without much change.
example:
forecasting daily demand for bread in a shop where sales are usually around 100 units with small variations.
advantages:

2. holt’s double exponential smoothing:

this is an extension of simple exponential smoothing and is used when the data has a trend (either upward or downward). it uses two equations — one for the current level and one for the trend.
it uses two smoothing constants:


Unit 3 - Numerical

Q. SEM 2/Time Series/attachments/Pasted image 20250625113426.png

Autocorrelation Calculation for Time Series

Given Data

Observations: 34, 24, 23, 31, 38, 34, 35, 31, 29, 28 n = 10 observations

Step 1: Calculate the Mean

x̄ = (34 + 24 + 23 + 31 + 38 + 34 + 35 + 31 + 29 + 28) ÷ 10 x̄ = 307 ÷ 10 = 30.7

Step 2: Calculate Deviations from Mean

t xt xt - x̄ (xt - x̄)²
1 34 3.3 10.89
2 24 -6.7 44.89
3 23 -7.7 59.29
4 31 0.3 0.09
5 38 7.3 53.29
6 34 3.3 10.89
7 35 4.3 18.49
8 31 0.3 0.09
9 29 -1.7 2.89
10 28 -2.7 7.29

Sum of squared deviations = 208.1

Step 3: Calculate Sample Variance

s² = Σ(xt - x̄)² ÷ (n-1) = 208.1 ÷ 9 = 23.12

Step 4: Calculate Autocorrelations

For r₁ (lag 1):

Products: (xt - x̄)(xt+1 - x̄) for t = 1 to 9

t (xt - x̄) (xt+1 - x̄) Product
1 3.3 -6.7 -22.11
2 -6.7 -7.7 51.59
3 -7.7 0.3 -2.31
4 0.3 7.3 2.19
5 7.3 3.3 24.09
6 3.3 4.3 14.19
7 4.3 0.3 1.29
8 0.3 -1.7 -0.51
9 -1.7 -2.7 4.59

Sum = 72.01 r₁ = 72.01 ÷ (9 × 23.12) = 0.346

For r₂ (lag 2):

Products: (xt - x̄)(xt+2 - x̄) for t = 1 to 8

t (xt - x̄) (xt+2 - x̄) Product
1 3.3 -7.7 -25.41
2 -6.7 0.3 -2.01
3 -7.7 7.3 -56.21
4 0.3 3.3 0.99
5 7.3 4.3 31.39
6 3.3 0.3 0.99
7 4.3 -1.7 -7.31
8 0.3 -2.7 -0.81

Sum = -58.38 r₂ = -58.38 ÷ (8 × 23.12) = -0.316

For r₃ (lag 3):

Products: (xt - x̄)(xt+3 - x̄) for t = 1 to 7

t (xt - x̄) (xt+3 - x̄) Product
1 3.3 0.3 0.99
2 -6.7 7.3 -48.91
3 -7.7 3.3 -25.41
4 0.3 4.3 1.29
5 7.3 0.3 2.19
6 3.3 -1.7 -5.61
7 4.3 -2.7 -11.61

Sum = -87.07 r₃ = -87.07 ÷ (7 × 23.12) = -0.538

For r₄ (lag 4):

Products: (xt - x̄)(xt+4 - x̄) for t = 1 to 6

t (xt - x̄) (xt+4 - x̄) Product
1 3.3 7.3 24.09
2 -6.7 3.3 -22.11
3 -7.7 4.3 -33.11
4 0.3 0.3 0.09
5 7.3 -1.7 -12.41
6 3.3 -2.7 -8.91

Sum = -52.36 r₄ = -52.36 ÷ (6 × 23.12) = -0.377

For r₅ (lag 5):

Products: (xt - x̄)(xt+5 - x̄) for t = 1 to 5

t (xt - x̄) (xt+5 - x̄) Product
1 3.3 3.3 10.89
2 -6.7 4.3 -28.81
3 -7.7 0.3 -2.31
4 0.3 -1.7 -0.51
5 7.3 -2.7 -19.71

Sum = -40.45 r₅ = -40.45 ÷ (5 × 23.12) = -0.350

Final Results:

Formula Used:

rₖ = Σ(xt - x̄)(xt+k - x̄) ÷ [(n-k) × s²]

where:

SEM 2/Time Series/attachments/Pasted image 20250625113552.png
SEM 2/Time Series/attachments/Pasted image 20250625113612.png

Q1. Given a time series Y = [5, 6, 4, 7, 6, 5, 4, 6, 5, 6], compute the mean and variance. Verify if the mean and variance are constant.

Given Data

Time Series Y = [5, 6, 4, 7, 6, 5, 4, 6, 5, 6] n = 10 observations


Step 1: Calculate Overall Mean and Variance

Mean Calculation Table

t Yₜ
1 5
2 6
3 4
4 7
5 6
6 5
7 4
8 6
9 5
10 6
Total 54

Overall Mean (Ȳ) = 54 ÷ 10 = 5.4

Variance Calculation Table

t Yₜ (Yₜ - Ȳ) (Yₜ - Ȳ)²
1 5 -0.4 0.16
2 6 0.6 0.36
3 4 -1.4 1.96
4 7 1.6 2.56
5 6 0.6 0.36
6 5 -0.4 0.16
7 4 -1.4 1.96
8 6 0.6 0.36
9 5 -0.4 0.16
10 6 0.6 0.36
Total 54 0 8.40

Variance (σ²) = Σ(Yₜ - Ȳ)² ÷ n = 8.40 ÷ 10 = 0.84


Step 2: Test for Constant Mean (Stationarity in Mean)

Method: Split Series into Two Halves

First Half (t = 1 to 5): Y₁ = [5, 6, 4, 7, 6] Second Half (t = 6 to 10): Y₂ = [5, 4, 6, 5, 6]

Mean Comparison Table

Period Observations Sum Mean
First Half (1-5) 5, 6, 4, 7, 6 28 5.6
Second Half (6-10) 5, 4, 6, 5, 6 26 5.2
Overall All 10 values 54 5.4

Difference in Means = |5.6 - 5.2| = 0.4


Step 3: Test for Constant Variance (Stationarity in Variance)

Variance of Each Half

First Half Variance:

t Y�t (Yₜ - Ȳ₁) (Yₜ - Ȳ₁)²
1 5 -0.6 0.36
2 6 0.4 0.16
3 4 -1.6 2.56
4 7 1.4 1.96
5 6 0.4 0.16
Total 28 0 5.20

σ₁² = 5.20 ÷ 5 = 1.04

Second Half Variance:

t Yₜ (Yₜ - Ȳ₂) (Yₜ - Ȳ₂)²
6 5 -0.2 0.04
7 4 -1.2 1.44
8 6 0.8 0.64
9 5 -0.2 0.04
10 6 0.8 0.64
Total 26 0 2.80

σ₂² = 2.80 ÷ 5 = 0.56

Variance Comparison Table

Period Variance
First Half (1-5) 1.04
Second Half (6-10) 0.56
Overall 0.84

Difference in Variances = |1.04 - 0.56| = 0.48


Step 4: Results and Conclusion

Summary Table

Parameter Overall First Half Second Half Difference
Mean 5.4 5.6 5.2 0.4
Variance 0.84 1.04 0.56 0.48

Conclusion:

  1. Mean Constancy: The difference between first and second half means is 0.4, which is relatively small (7.4% of overall mean). The mean appears approximately constant.

  2. Variance Constancy: The difference between first and second half variances is 0.48, which is significant (57% difference). The variance shows notable variation between periods.

  3. Overall Assessment: The time series shows reasonable stability in mean but significant variation in variance, indicating it may not be perfectly stationary.


Final Answer:


Q2. For the series Y = [5, 6, 4, 7, 6, 5, 4, 6, 5, 6], compute autocovariance and autocorrelation at lag 1.

Given Data

Yields: 47, 64, 23, 71, 38, 64, 55, 41, 59, 48 n = 10 observations


Step 1: Calculate Mean

Mean Calculation Table

t Yₜ
1 47
2 64
3 23
4 71
5 38
6 64
7 55
8 41
9 59
10 48
Total 510

Mean (Ȳ) = 510 ÷ 10 = 51


Step 2: Calculate Deviations and Squared Deviations

Deviation Calculation Table

t Yₜ (Yₜ - Ȳ) (Yₜ - Ȳ)²
1 47 -4 16
2 64 13 169
3 23 -28 784
4 71 20 400
5 38 -13 169
6 64 13 169
7 55 4 16
8 41 -10 100
9 59 8 64
10 48 -3 9
Total 510 0 1896

Step 3: Calculate Autocovariance at Lag 0

C₀ = Σ(Yₜ - Ȳ)² ÷ n = 1896 ÷ 10 = 189.6


Step 4: Calculate Autocovariance at Lag 1 (C₁)

Lag 1 Autocovariance Table

For C₁, we need (Yₜ - Ȳ)(Yₜ₊₁ - Ȳ) for t = 1 to 9

t Yₜ Yₜ₊₁ (Yₜ - Ȳ) (Yₜ₊₁ - Ȳ) (Y�t - Ȳ)(Yₜ₊₁ - Ȳ)
1 47 64 -4 13 -52
2 64 23 13 -28 -364
3 23 71 -28 20 -560
4 71 38 20 -13 -260
5 38 64 -13 13 -169
6 64 55 13 4 52
7 55 41 4 -10 -40
8 41 59 -10 8 -80
9 59 48 8 -3 -24
Total -1497

C₁ = Σ(Yₜ - Ȳ)(Yₜ₊₁ - Ȳ) ÷ (n-1) = -1497 ÷ 9 = -166.33


Step 5: Calculate Autocovariance at Lag 2 (C₂)

Lag 2 Autocovariance Table

For C₂, we need (Yₜ - Ȳ)(Yₜ₊₂ - Ȳ) for t = 1 to 8

t Yₜ Yₜ₊₂ (Yₜ - Ȳ) (Yₜ₊₂ - Ȳ) (Yₜ - Ȳ)(Yₜ₊₂ - Ȳ)
1 47 23 -4 -28 112
2 64 71 13 20 260
3 23 38 -28 -13 364
4 71 64 20 13 260
5 38 55 -13 4 -52
6 64 41 13 -10 -130
7 55 59 4 8 32
8 41 48 -10 -3 30
Total 876

C₂ = Σ(Yₜ - Ȳ)(Yₜ₊₂ - Ȳ) ÷ (n-2) = 876 ÷ 8 = 109.5


Step 6: Calculate Autocorrelation Coefficients

Autocorrelation Formula

ρₖ = Cₖ ÷ C₀

Autocorrelation Calculation Table

Lag (k) Autocovariance (Cₖ) Autocorrelation (ρₖ)
0 189.6 1.000
1 -166.33 -0.877
2 109.5 0.578

Calculations:


Step 7: Summary of Results

Final Results Table

Parameter Value
Mean (Ȳ) 51
Autocovariance at lag 0 (C₀) 189.6
Autocovariance at lag 1 (C₁) -166.33
Autocovariance at lag 2 (C₂) 109.5
Autocorrelation at lag 0 (ρ₀) 1.000
Autocorrelation at lag 1 (ρ₁) -0.877
Autocorrelation at lag 2 (ρ₂) 0.578

Key Formulas Used:

  1. Mean: Ȳ = ΣYₜ ÷ n

  2. Autocovariance: Cₖ = Σ(Yₜ - Ȳ)(Yₜ₊ₖ - Ȳ) ÷ (n-k)

  3. Autocorrelation: ρₖ = Cₖ ÷ C₀


Interpretation:

Q3. Apply simple exponential smoothing with α = 0.5 to the series Y = [50, 52, 53, 54, 56]. The initial forecast is 50. Provide forecast for t = 6.

we are given:

step-by-step calculations:

F₁ = 50 (given)

t = 2:
F₂ = 0.5 × 50 + 0.5 × 50 = 25 + 25 = 50

t = 3:
F₃ = 0.5 × 52 + 0.5 × 50 = 26 + 25 = 51

t = 4:
F₄ = 0.5 × 53 + 0.5 × 51 = 26.5 + 25.5 = 52

t = 5:
F₅ = 0.5 × 54 + 0.5 × 52 = 27 + 26 = 53

t = 6 (forecast):
F₆ = 0.5 × 56 + 0.5 × 53 = 28 + 26.5 = 54.5

final answer:

forecast for t = 6 is 54.5


Q4. Using Holt’s double exponential smoothing, compute the forecast for t = 4 given Y = [100, 108, 117], α = 0.6, β = 0.4.

we are given:

holt's method uses two equations:

step 1: initialize (t = 1)

L₁ = Y₁ = 100
T₁ = Y₂ − Y₁ = 108 − 100 = 8

step 2: t = 2

Y₂ = 108
L₂ = α × Y₂ + (1 − α) × (L₁ + T₁)
L₂ = 0.6 × 108 + 0.4 × (100 + 8) = 64.8 + 0.4 × 108 = 64.8 + 43.2 = 108

T₂ = β × (L₂ − L₁) + (1 − β) × T₁
T₂ = 0.4 × (108 − 100) + 0.6 × 8 = 0.4 × 8 + 4.8 = 3.2 + 4.8 = 8

step 3: t = 3

Y₃ = 117
L₃ = 0.6 × 117 + 0.4 × (108 + 8) = 70.2 + 0.4 × 116 = 70.2 + 46.4 = 116.6
T₃ = 0.4 × (116.6 − 108) + 0.6 × 8 = 0.4 × 8.6 + 4.8 = 3.44 + 4.8 = 8.24

step 4: forecast for t = 4

F₄ = L₃ + T₃ = 116.6 + 8.24 = 124.84
final answer: forecast for t = 4 is 124.84


Q5. Given quarterly data Y = [30, 21, 29, 36, 42, 33, 41, 48], apply additive Holt-Winters method to compute L5, T5, S5 and forecast for t = 9. Use α = 0.5, β = 0.4, γ = 0.3, s = 4.

Holt-Winters Additive Method Solution

Given Data

Step 1: Initialize Parameters

Initial Level (L₁)

For additive model, we use the average of first season:
L₁ = (Y₁ + Y₂ + Y₃ + Y₄)/4 = (30 + 21 + 29 + 36)/4 = 116/4 = 29
Why: The initial level represents the deseasonalized average of the first complete seasonal cycle.

Initial Trend (T₁)

T₁ = [(Y₅ + Y₆ + Y₇ + Y₈) - (Y₁ + Y₂ + Y₃ + Y₄)]/(2s)
T₁ = [(42 + 33 + 41 + 48) - (30 + 21 + 29 + 36)]/(2×4)
T₁ = [164 - 116]/8 = 48/8 = 6

Why: The initial trend is calculated as the difference between the averages of two consecutive seasons, divided by 2s to get the trend per period.

Initial Seasonal Indices (S₁, S₂, S₃, S₄)

For additive model: Sᵢ = Yᵢ - L₁

Step 2: Apply Holt-Winters Equations

The three equations for additive model are:

For t = 2:

Level: L₂ = α(Y₂ - S₂₋₄) + (1-α)(L₁ + T₁)
Since we don't have S₋₂, we use S₂ = -8
L₂ = 0.5(21 - (-8)) + 0.5(29 + 6) = 0.5(29) + 0.5(35) = 14.5 + 17.5 = 32
Trend: T₂ = β(L₂ - L₁) + (1-β)T₁ = 0.4(32 - 29) + 0.6(6) = 0.4(3) + 3.6 = 4.8
Seasonal: S₂ = γ(Y₂ - L₂) + (1-γ)S₂ = 0.3(21 - 32) + 0.7(-8) = 0.3(-11) + (-5.6) = -8.9

For t = 3:

Level: L₃ = 0.5(29 - 0) + 0.5(32 + 4.8) = 14.5 + 18.4 = 32.9
Trend: T₃ = 0.4(32.9 - 32) + 0.6(4.8) = 0.36 + 2.88 = 3.24
Seasonal: S₃ = 0.3(29 - 32.9) + 0.7(0) = 0.3(-3.9) = -1.17

For t = 4:

Level: L₄ = 0.5(36 - 7) + 0.5(32.9 + 3.24) = 14.5 + 18.07 = 32.57
Trend: T₄ = 0.4(32.57 - 32.9) + 0.6(3.24) = -0.132 + 1.944 = 1.812
Seasonal: S₄ = 0.3(36 - 32.57) + 0.7(7) = 1.029 + 4.9 = 5.929

For t = 5:

Level: L₅ = 0.5(42 - 1) + 0.5(32.57 + 1.812) = 20.5 + 17.191 = 37.691
Trend: T₅ = 0.4(37.691 - 32.57) + 0.6(1.812) = 2.0484 + 1.0872 = 3.1356
Seasonal: S₅ = 0.3(42 - 37.691) + 0.7(1) = 1.2927 + 0.7 = 1.9927

Step 3: Results for t = 5

Step 4: Forecast for t = 9

For additive Holt-Winters, the forecast equation is:
Ŷₜ₊ₕ = Lₜ + h×Tₜ + Sₜ₊ₜ₋ₛ
Where:

For t = 6:

L₆ = 0.5(33 - (-8.9)) + 0.5(37.691 + 3.136) = 20.95 + 20.4135 = 41.3635
T₆ = 0.4(41.3635 - 37.691) + 0.6(3.136) = 1.469 + 1.8816 = 3.3506

For t = 7:

L₇ = 0.5(41 - (-1.17)) + 0.5(41.3635 + 3.3506) = 21.085 + 22.357 = 43.442
T₇ = 0.4(43.442 - 41.3635) + 0.6(3.3506) = 0.8314 + 2.0104 = 2.8418

For t = 8:

L₈ = 0.5(48 - 5.929) + 0.5(43.442 + 2.8418) = 21.0355 + 23.142 = 44.1775
T₈ = 0.4(44.1775 - 43.442) + 0.6(2.8418) = 0.294 + 1.7051 = 1.9991

Forecast for t = 9:

Ŷ₉ = L₈ + 1×T₈ + S₅
Ŷ₉ = 44.1775 + 1.9991 + 1.993 = 48.17

Final Answer


Q6. Apply Brown’s discounted regression (double exponential smoothing) with α = 0.5 to Y = [50, 52, 53, 54]. Compute level, trend, and forecast for t = 5.

Problem:
Apply Brown’s double exponential smoothing (a.k.a. discounted regression)
Given:

Step 1: What is Brown’s Double Exponential Smoothing?

This method is used when the time series has a trend, but no seasonality.
It smooths the data twice to estimate:

Step 2: Formulas Used

These are the core formulas:
4. S₁(t) = α × Yₜ + (1 − α) × S₁(t−1)
→ smooths the raw data once
5. S₂(t) = α × S₁(t) + (1 − α) × S₂(t−1)
→ smooths the smoothed data (S₁) again
6. Level (aₜ) = 2 × S₁(t) − S₂(t)
→ estimates current value based on double smoothing
7. Trend (bₜ) = (α / (1 − α)) × (S₁(t) − S₂(t))
→ measures the slope of the data (rate of change)
8. Forecast (Fₜ₊ₕ) = aₜ + bₜ × h
→ future forecast h steps ahead

Step 3: Initialization (Why?)

At t = 1, we need starting values.
We start by setting:

Step 4: Apply the Smoothing Equations (Why?)

Now apply formulas from t = 2 to t = 4 to calculate S₁ and S₂.

At t = 2 (Y₂ = 52):

S₁(2) = 0.5 × 52 + 0.5 × 50 = 51
→ Why? Smooth the actual value (52) with the previous smoothed value (50)

S₂(2) = 0.5 × 51 + 0.5 × 50 = 50.5
→ Why? Smooth S₁(2) using S₂(1)

At t = 3 (Y₃ = 53):

S₁(3) = 0.5 × 53 + 0.5 × 51 = 52
S₂(3) = 0.5 × 52 + 0.5 × 50.5 = 51.25

At t = 4 (Y₄ = 54):

S₁(4) = 0.5 × 54 + 0.5 × 52 = 53
S₂(4) = 0.5 × 53 + 0.5 × 51.25 = 52.125

Step 5: Calculate Level and Trend at t = 4 (Why?)

Now that we’ve smoothed the data, we estimate:

Level (a₄):

a₄ = 2 × S₁(4) − S₂(4) = 2 × 53 − 52.125 = 53.875
→ Why? This estimates the base value by adjusting for double smoothing.

Trend (b₄):

b₄ = (α / (1 − α)) × (S₁(4) − S₂(4))
= (0.5 / 0.5) × (53 − 52.125) = 1 × 0.875 = 0.875
→ Why? Measures how much the series is rising each step.

Step 6: Forecast for t = 5 (Why?)

We use:
F₅ = a₄ + b₄ × 1 = 53.875 + 0.875 = 54.75
→ Why? We're projecting one time step into the future using the current level + one unit of the trend.

Final Answer:


Q7. Given AR(1) model Yt = 0.8Yt−1 + ϵt , with Y0 = 10 and ϵt = [0.5, −0.3, 1.0, −0.2, 0.1], compute Y1 to Y5.

AR(1) Model Step-by-Step Solution

Given Information

Understanding the AR(1) Model

What is AR(1)?

AR(1) stands for Autoregressive model of order 1, which means:

Model Components Explained:

Why This Formula Works:

The equation Yₜ = 0.8Yₜ₋₁ + εₜ tells us:
9. 80% of current value comes from the previous value (0.8Yₜ₋₁)
10. 20% is new information from the error term (εₜ)
11. Since 0.8 < 1, the series will gradually return to zero if no shocks occur

Step-by-Step Calculations

Step 1: Calculate Y₁

Given: Y₀ = 10, ε₁ = 0.5
Formula: Y₁ = 0.8Y₀ + ε₁ Calculation: Y₁ = 0.8 × 10 + 0.5 = 8.0 + 0.5 = 8.5
Why this step: We start with the initial condition Y₀ = 10 and apply the AR(1) formula to get the first forecasted value. The 0.8 coefficient means we take 80% of the previous value, then add the random shock.

Step 2: Calculate Y₂

Given: Y₁ = 8.5, ε₂ = -0.3
Formula: Y₂ = 0.8Y₁ + ε₂ Calculation: Y₂ = 0.8 × 8.5 + (-0.3) = 6.8 - 0.3 = 6.5
Why this step: Now Y₁ becomes our "previous value" and we apply the same logic. The negative error term (-0.3) pushes the value down from what it would have been with just the autoregressive component.

Step 3: Calculate Y₃

Given: Y₂ = 6.5, ε₃ = 1.0
Formula: Y₃ = 0.8Y₂ + ε₃ Calculation: Y₃ = 0.8 × 6.5 + 1.0 = 5.2 + 1.0 = 6.2
Why this step: The positive shock (1.0) is relatively large, boosting the value significantly above what the autoregressive component alone would give us (5.2).

Step 4: Calculate Y₄

Given: Y₃ = 6.2, ε₄ = -0.2
Formula: Y₄ = 0.8Y₃ + ε₄ Calculation: Y₄ = 0.8 × 6.2 + (-0.2) = 4.96 - 0.2 = 4.76
Why this step: The negative shock (-0.2) pulls the value down slightly from the autoregressive component (4.96).

Step 5: Calculate Y₅

Given: Y₄ = 4.76, ε₅ = 0.1
Formula: Y₅ = 0.8Y₄ + ε₅ Calculation: Y₅ = 0.8 × 4.76 + 0.1 = 3.808 + 0.1 = 3.908
Why this step: The small positive shock (0.1) slightly increases the value from the autoregressive component.

Summary of Results

Time (t) Previous Value (Yₑ₋₁) Error (εₜ) AR Component (0.8Yₜ₋₁) Final Value (Yₜ)
1 10.000 0.5 8.000 8.500
2 8.500 -0.3 6.800 6.500
3 6.500 1.0 5.200 6.200
4 6.200 -0.2 4.960 4.760
5 4.760 0.1 3.808 3.908

Key Observations

Pattern Analysis:

  1. Mean Reversion: Without error terms, the series would decay toward 0 (since 0.8 < 1)
  2. Shock Impact: Error terms create deviations from the smooth decay pattern
  3. Persistence: Each value still carries 80% of the previous period's influence

Why AR(1) is Useful:

Final Answer: